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CLAIMS 



1. DNA molecule that codes for a protein of the TGF-6 
family and which comprises 

(a) the part coding for the mature protein and if 
desired further functional parts of the nucleotide 
sequence shown in SEQ ID NO. 1, 

(b) a nucleotide sequence corresponding to the 
sequence from (a) within the scope of the 
degeneracy of the genetic code, 

(c) a nucleotide sequence corresponding to an 
allelic derivative of one of the sequences from (a) 
and (b) , or 

(d) a nucleotide sequence hybridizing with one of 
the sequences from (a) , (b) or (c) 

provided that a DNA molecule according to (d) 
completely contains at least the part coding for a 
mature protein of the TGF-B family. 

2. Vector, 
wherein 

it contains at least one copy of a DNA molecule as 
claimed in claim 1. 

3. Host cell, 
wherein 

it is transformed by a DNA as claimed in claim 1 or 
by a vector as claimed in claim 2. 
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4. Host cell as claimed in claim 3, 
wherein 

it is a bacterium, a fungus, a plant or an animal 
cell. 

5. Protein of the TGF-6 family which is coded by a DNA 
sequence as claimed in claim 1. 

5. Protein as claimed in claim 5, 
wherein 

it has the amino acid sequence shown in SEQ ID 

NO. 2 or, if desired, functional parts thereof. 

7. Process for the production of a protein of the 
TGF-B family, 

wherein 

a host cell as claimed in claim 3 or 4 is cultured 
and the TGF-B protein is isolated from the cell 
or /and from the culture supernatant. 

8. Pharmaceutical composition, 
wherein 

it contains at least one protein as claimed in 
claim 5 or 6 as the active substance if desired, 
together with the usual pharmaceutical carrier 
substances, auxiliary substances, diluents or 
fillers. 

9. Pharmaceutical composition as claimed in claim 8 
for the treatment or prevention of damage to bone, 
cartilage, connective tissues, skin, mucous 
membranes, epithelium or teeth, for application in 
dental implants and for application in wound- 
healing and tissue regeneration processes. 
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10. Antibody or antibody fragments, 
wherein 

they bind to a protein as claimed in claim 5 or 6. 



DESCRIPTION 



The present invention concerns a new growth/differen- 
tiation factor of the TGF-B family and DNA sequences 
coding therefor. 

The TGF-B family of growth factors which includes BMP-, 
TGF- and inhibin-related proteins (Roberts and Sporn, 
Handbook of Experimental Pharmacology 95 (1990) , 419- 
472) is particularly relevant for a wide range of 
medical treatment methods and applications. These 
factors are suitable in methods which concern wound- 
healing and tissue regeneration. Furthermore several 
members from t *2 TGF-B family induce tissue growth, in 
particular growth of bones, and therefore play a crucial 
role in inducing the development of cartilage and bones. 

Wozney (Progress in Growth Factor Research 1 (1989) , 
267-280) and Vale et al (Handbook of Experimental 
Pharmacology 95 (1990) , 211-248) describe various growth 
factors such as those which are related to the BMP group 
(bone morphogenetic proteins) and the inhibin group. The 
members of these groups show significant structural 
similarities. The precursor of the protein consists of 
an amino-terminal signal sequence, a propeptide and a 
carboxy-terminal sequence of about 110 amino acids that 
are cleaved from the precursor and constitute the mature 
protein. In addition their members are defined by an 
amino acid sequence homology. The mature protein 
contains the most conserved sequences, in particular 
seven cysteine residues which are conserved among the 
family members. The TGF-B-like proteins are 
multifunctional, hormonally active growth factors. They 
also have related biological activities such as 
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chemotactic attraction of cells, promotion of cell 
differentiation and tissue-inducing capabilities such as 
cartilage-inducing and bone-inducing capabilities. The 
US Patent No. 5,013,649 discloses DNA sequences that 
code for osteo-inductive proteins that are denoted BMP-2 
and the US Patent Applications series No. 179 101 and 
170 197 disclose the BMP proteins BMP-1 and BMP-3. 
Moreover many types of cells are able to synthesize TGF- 
B-like proteins and practically all cells have TGF-B 
receptors . 

On the whole these proteins show differences in their j 
structure which leads to significant variations in their 
exact biological function. In addition they are found in 
a wide range of different types of tissue and at various | 
stages of development. As a result they can exhibit I 
differences with regard to their exact function e.g. the j 
required cellular physiological environment, their life- j 
span, their target sites, their requirements for | 
auxiliary factors and their stability against 
degradation. Thus, although a multitude of proteins have 
been described that exhibit a tissue-inductive and in 
particular osteo-inductive potential, their natural 

functions in the organism and - more significantly - j 

their medical relevance still have to be investigated in 

detail. It is thought to be highly probable that members 

of the TGF-B family are present that are still unknown 

which are important for osteogenesis or the 

differentiation/ induction of other types of tissue. A 

major difficulty in the isolation of these new TGF-B- 

like proteins is, however, that their functions cannot 

yet be described exactly enough to develop highly 

discriminating bioassays. On the other hand the expected 

nucleotide sequence homology to other members of the 

family is too low to enable a screening by classical 
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nucleic acid hybridization techniques. Nevertheless the 
further isolation and characterization of new TGF-6-like 
proteins is urgently required in order to provide 
further inducing and differentiation proteins that 
fulfil all the desired medical requirements. These 
factors could be used medically in the healing of 
lesions and the treatment of degenerative diseases of 
bones and/ or other types of tissue such as the kidney or 
the liver. 

A nucleotide and amino acid sequence for the TGF-J3 
protein MP-52 is stated in the Patent Application 
PCT/EP93/00350 in which the sequence corresponding to 
the mature peptide and a major portion of the sequence 
corresponding to the propeptide of MP-52 are given. The 
complete sequence of the propeptide MP-52 is not 
disclosed. 

The object on which the present invention is based is to 
provide DNA sequences that code for new members of the 
TGF-J3 protein family with raitogenic and/or 
differentiation-inductive e.g. osteo-inductive 
potential. The object of the present invention was 
therefore in particular to provide the complete DNA and 
amino acid sequence of the TGF protein MP-52. 

This object is achieved by a DNA molecule that codes for 
a protein of the TGF-6 family and which comprises 

(a) the part coding for the mature protein and if 
desired further functional parts of the nucleotide 
sequence shown in SEQ ID NO. 1, 

(b) a nucleotide sequence corresponding to a sequence 
from (a) within the scope of the degeneracy of the 
genetic code, 
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(c) an allelic derivative of a nucleotide sequence 
corresponding to one of the sequences from (a) and (b) 
or 

(d) a sequence hybridizing with one of the sequences 
from (a) , (b) or (c) 

provided that a DNA molecule according to (d) contains 
at least the part coding for a mature protein from the 
TGF-J3 family. 

Further embodiments of the present invention concern the 
subject matter of claims 2 to 10. Other features and 
advantages of the invention can be derived from the 
description of the preferred embodiments and figures. 
The sequence protocols and figures are now briefly 
described. 

SEQ ID NO. 1 shows the complete nucleotide sequence of 
the DNA coding for the TGF-B protein MP-52. The ATG 
start codon starts with nucleotide 640. The start of the 
mature protein begins after nucleotide 1782. 

SEQ ID NO. 2 shows the complete amino acid sequence of 
the TGF-J3 protein MP-52 which was derived from the 
nucleotide sequence shown in SEQ ID NO. 1. 

Figure 1 shows a comparison between the amino acid 
sequence of MP-52 and several members of the BMP protein 
family starting with the first of the seven conserved 
cysteine residues. * denotes that the amino acid is the 
same in all compared proteins; + denotes that the amino 
acid corresponds in at least one of the proteins 
compared to MP-52. 
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Figure 2 shows the nucleotide sequences of the 
oligonucleotide primers that were used in the present 
invention and a comparison of these sequences with known 
members of the TGF-6 family. M denotes A or C, S denotes 
C or G, R denotes A or G and K denotes G or T. 2a shows 
the sequence of the primer OD, 2b shows the sequence of 
the primer OID. 

The present invention encompasses at least the part 
coding for the mature protein and if desired further 
functional parts of the nucleotide sequence shown in SEQ 
ID N0.1 as well as sequences that correspond to this 
sequence within the scope of the degeneracy of the 
genetic code and allelic derivatives of such sequences. 
In addition the present invention also encompasses 
sequences that hybridize with such sequences provided 
that such a DNA molecule completely contains at least 
the part coding for the mature protein of the TGF-S 
family. 

The term "functional part" within the sense of the 
present invention denotes a protein part which is 
capable of acting for example as a signal peptide, 
propeptide or as a mature protein part i.e. it fulfils 
at least one of the biological functions of the natural 
protein parts of MF-52 . 

The region coding for the mature part of the protein 
extends from nucleotides 1783 - 2142 of the sequence 
shown in SEQ ID N0.1. If desired, the DNA molecule can 
also comprise further functional parts of the sequence 
shown in SEQ ID NO. 1, namely the nucleotide sequences 
coding for the signal or /and propeptide part. It is 
particularly preferred that the DNA molecule comprises 
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the sequence for the signal and the propeptide part and 
the part of the mature protein i.e. nucleotides 640-2142 
of the sequence shown in SEQ ID NO. 1. On the other hand 
the DNA molecule can also comprise functional signal 
or/ and propeptide parts from other proteins in addition 
to the part coding for the mature protein, in particular 
from other proteins of the TGF-6 family e.g. the above- 
mentioned BMP proteins. The respective nucleotide 
sequences are given in the references mentioned above to 
the disclosure of which reference is hereby being made. 

Moreover the present invention also encompasses a DNA 
molecule as defined above that contains a non-coding 
intron sequence between nucleotides 1270 and 1271 of the 
sequence shown in SEQ ID NO. 1. This intron sequence is 
contained in the plasmid SKL 52 (H3) MP12 which is 
deposited at the DSM and has the genomic nucleic acid 
sequence of MP-52. 

The invention also encompasses the cDNA sequence of the 
MP-52 protein coded by the phage X 15.1. This sequence 
starts with nucleotide 321 of SEQ ID NO. 1. 

Although the allelic, degenerate and hybridizing 
sequences which are encompassed by the present invention 
have structural differences due to slight changes in 
their nucleotide or/and amino acid sequence, the 
proteins coded by such sequences still essentially have 
the same useful properties that enable their use in 
basically the same medical applications. 

The term "hybridization" according to the present 
invention means the usual hybridization conditions, 
preferably conditions with a salt concentration of 6 x 
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SSC at 62 to 66°C followed by a one hour wash with 0.6 x 
SSC, 0.1 % SDS at 62 to 66°C. It is particularly- 
preferred that the term "hybridization" denotes 
stringent hybridization conditions with a salt 
concentration of 4 x SSC at 62 to 66 °C followed by a one 
hour wash with 0.1 x SSC f 0.1 % SDS at 62 to 66°C. 

Preferred embodiments of the present invention are DNA 
sequences as defined above that are obtainable from 
vertebrates, preferably mammals such as pigs, cows and 
rodents such as rats or mice and in particular from 
primates such as humans. 

A particularly preferred embodiment of the present 
invention is the sequence denoted MP-52 shown in SEQ ID 
NO. 1. The transcripts of MP-52 were obtained from 
embryonic tissue and code for a protein which has a 
considerable amino acid homology to the mature portion 
of BMP-like proteins (see Fig. 1) . The protein sequences 
of BMP 2 (=BMP2A) and BMP 4 (=BMP2B) are described by 
Wozney et al., Science 242 (1988), 1528-1534. The 
corresponding sequences of BMP5, BMP 6 and BMP7 are 
described by Celeste et al., Proc. Natl. Acad. Sci. USA 
87 (1990), 9843-9847. Several typical sequence 
homologies which are specific for known BMP sequences 
have also been found in the propeptide part of MP-52 
whereas other parts of the precursor part of MP-52 
exhibit considerable differences to BMP precursors. 

In addition the present invention concerns a vector that 
contains at least one copy of a DNA molecule according 
to the invention. The DNA sequence according to the 
invention is preferably operatively linked to an 
expression control sequence in such a vector. Such 
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vectors are suitable for the production of TGF-6-like 
proteins in stably or transiently transformed cells. 
Various animal, plant, fungal and bacterial systems can 
be used for the transformation and subsequent culture. 
The vectors according to the invention preferably 
contain sequences necessary for replication in the host 
cell and are autonomously replicable. Furthermore the 
use of vectors is preferred that contain selectable 
marker genes which can be used to detect transformation 
of a host cell. 

In addition the invention concerns a host cell which is 
transformed with a DNA according to the invention or 
with a vector according to the invention. Examples of 
suitable host cells include various eukaryotic and 
prokaryotic cells such as E. coli, insect cells, plant 
cells, mammalian cells and fungi such as yeast. 

In addition the invention concerns a protein of the TGF- 
6 family which is coded by a DNA sequence according to 
claim 1. The protein according to the invention 
preferably has the amino acid sequence shown in SEQ ID 
NO. 2 or if desired functional parts thereof and 
exhibits biological properties such as tissue-inductive 
in particular osteo-inductive or/and mitogenic 
capabilities that may be relevant for a therapeutic 
application. The above-mentioned characteristics of the 
protein can vary depending on the formation of 
homodimers or heterodimers. Such structures may also 
prove to be suitable for clinical applications. 

The biological properties of the proteins according to 
the invention, in particular the mitogenic and osteo- 
inductive potential can be determined for example in 
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assays according to Roberts et al. , PNAS 78 (1981), 
5339-5343, Seyedin et al., PNAS 82 (1985), 2267-2271 or 
Sampath and Reddi, PNAS 78 (1981), 7599-7603. 

Furthermore the present invention concerns a process for 
the production of a protein of the TGF-fl family which is 
characterized in that a host cell transformed with a DNA 
according to the invention or with a vector according to 
the invention is cultured and the TGF-B protein is 
isolated from the cell or/and culture supernatant. Such 
a process comprises culturing the transformed host cell 
in a suitable culture medium and purifying the TGF-B- 
like protein produced. In this manner the process 
enables the production of an adequate amount of the 
desired protein for use in medical treatment or in 
applications using cell culture techniques which require 
growth factors. The host cell can be a bacterium such as 
Bacillus or E. coli, a fungus such as yeast, a plant 
cell such as tobacco, potato or arabidopsis or an animal 
cell and in particular a vertebrate cell line such as 
MoCOS or CHO cell lines or an insect cell line. 

Yet another subject matter of the present invention is 
the provision of pharmaceutical compositions which 
contain a pharmaceutically active amount of a TGF-S-like 
protein according to the invention as the active 
substance. If desired, such a composition comprises a 
pharmaceutically acceptable carrier substance, auxiliary 
substance, diluent or filler. Such a pharmaceutical 
composition can be used in wound-healing and tissue 
regeneration as well as in the healing of damage to 
bones, cartilage, connective tissue, skin, mucous 
membranes, epithelium or teeth and in dental implants 
either alone or in combination with other active 
substances e.g. other proteins of the TGF-B family or 
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growth factors such as EGF (epidermal growth factor) or 
PDGF (platelet derived growth factor) . Moreover such a 
pharmaceutical composition can be used in the prevention 
of diseases such as the prevention of osteoporosis and 
arthrosis. 

Another possible clinical application of the TGF-6 
protein according to the invention is to use it as a 
suppressor of an immunoreaction to prevent rejection of 
organ transplants or its application in connection with 
angiogenesis. The pharmaceutical composition according 
to the invention can also be used prophylactically or in 
cosmetic surgery. In addition administration of the 
composition is not limited to humans but can also 
encompass animals and in particular pets. 

Finally the present invention also concerns an antibody 
that binds specifically to the proteins according to the 
invention or such an antibody fragment (e.g. Fab or 
Fab 1 ) . Processes for the production of such a specific 
antibody or antibody fragment are part of the general 
technical knowledge of an average person skilled in the 
art. Such an antibody is preferably a monoclonal 
antibody. Such antibodies or antibody fragments may also 
be suitable for diagnostic methods. 

It is intended to elucidate the invention further by the 
following example. 
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Example 1 
Isolation of MP-52 

1.1 Total RNA was Isolated from human embryonic tissue 
(8 to 9 weeks old) according to the method of 
Chirgwin et al., Biochemistry 18 (1979), 5294-5299. 
Poly(A+) RNA was separated from the total RNA by 
oligo (dT) chromatography according to the 
manufacturer's instructions (Stratagene Poly (A) 
Quick columns) . 

1.2 For the reverse transcription reaction 1 to 2.5 ng 
poly (A+) RNA was heated for 5 minutes to 65 °C and 
quickly cooled on ice. The reaction mixture 
contained 27 U RNA-Guard (Pharmacia) , 2 . 5 \iq oligo 
(dT) 12-18 (Pharmacia), 5 x buffer (250 mmol/1 
Tris/HCl, pH 8.5, 50 mmol/1 MgCl 2 , 50 mmol/1 DTT, 

5 mmol/1 of each dNTP, 600 mmol/1 KCl) and 20 U AMV 
reverse transcriptase (Boehringer Mannheim) per jug 
poly (A+) RNA. The reaction mixture (25 /il) was 
incubated for 2 hours at 42 °C. 

1.3 The deoxynucleotide primers OD and OID shown in 
Fig. 2 were prepared on an automatic DNA 
synthesizer (Biosearch) . The purification was 
carried out by denaturing polyacrylamide gel 
electrophoresis and isolating the main bands from 
the gel by means of isotachophoresis. The 
oligonucleotides were designed by comparing the 
nucleic acid sequences of known members of the TGF- 
B family and selecting regions with the highest 
conservation. A comparison of this region is shown 
in Fig. 2. In order to facilitate cloning both 
nucleotides contained EcoRI restriction sites and 
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OD additionally contained a Ncol restriction site 
at its 5 1 terminus . 

1.4 cDNA corresponding to 20 ng poly (A+) RNA was used 
as the starting material in the PCR reaction. The 
reaction was carried out in a volume of 50 /zl and 
contained 1 x PCR buffer (16.6 mmol/1 (NH 4 ) 2 S0 4 , 

67 mmol/1 Tris/HCl pH 8.8, 2 mmol/1 MgCl 2 , 
6.7 /imol/1 EDTA, 10 mmol/1 B-mercapto-ethanol, 
170 /ig/ml bovine serum albumin (Gibco) , 200 jumol/1 
of each dNTP (Pharmacia) , 30 pmol of each 
oligonucleotide (OD and OID) and 1.5 U Taq 
polymerase (AmpliTaq, Perkin Elmer Cetus) . The 
reaction mixture was overlayed with paraffin and 4 0 
PCR cycles were carried out. The products of the 
PCR reaction were purified by phenol/chloroform 
extraction and concentrated by ethanol 
precipitation. 

1.5 The PCR reaction product was cleaved with the 
restriction enzymes SphI (Pharmacia) and AlwNI 
(Biolabs) according to the manufacturer's 
instructions. 

1 . 6 The products of the restriction cleavage were 
fractionated by Agarose gel electrophoresis. After 
staining with ethidium bromide, uncleaved 
amplification products were cut out of the gel and 
isolated by phenol extraction. The DNA obtained was 
subsequently purified twice by phenol /chloroform 
extraction. 
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1.7 After an ethanol precipitation, a quarter or a 
fifth of the isolated DNA was reamplified using the 
same conditions as for the primary amplification 
except that the number of cycles was reduced to 13 . 
The reamplif ication products were purified, cleaved 
with the same enzymes as above and the uncleaved 
products were isolated from Agarose gels as 
elucidated above for the amplification products. 
The reamplif ication step was repeated twice. 

1.8 After the last isolation from the gel, the 
amplification products were cleaved by 4 units 
EcoRI (Pharmacia) under the conditions recommended 
by the manufacturer. A fourth of the restriction 
mixture was ligated into the vector pBluescriptll 
SK+ (Stratagene) cleaved with EcoRI. 24 clones were 
analysed further by means of sequencing after 
ligation. The sample cleaved with AlwNI and SphI 
resulted in a new sequence that was denoted MP-52 . 
The other clones mainly contained BMP 6 sequences 
and one contained a BMP7 sequence. 

The clone was completed up to the 3 ' end of the cDNA 
according to the method described in detail by Frohmann 
(Amplifications, published by Perkin-Elmer Corp., Issue 
5 (1990), pp 11-15). The same embryonic mRNA that had 
been used to isolate the first fragment of MP-52 was 
reversally transcribed as described above.. The 
amplification was carried out using the adapter primer 
(AGAATTCGCATGCCATGGTCGACG) and an inner primer 
(CTTGAGTACGAGGCTTTCCACTG) of the MP-52 sequence. The 
amplification products were reamplified using an 
overlapping adapter primer ( ATTCGCATGCCATGGTCGACGAAG ) 
and an overlapping internal primer 

(GGAGCCCACGAATCATGCAGTCA) of the MP-52 sequence. After 
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restriction cleavage with Ncol the reamplif ication 
products were cloned and sequenced into a vector that 
was cleaved in the same way (pUC 19 (Pharmacia No. 27- 
4951 — 01) having a modified multiple cloning site which 
contains a single Ncol restriction site) and sequenced. 
The clones were characterized by their sequence 
overlapping at the 3' end of the known MP-52 sequence. 
One of these was used as a probe to screen a human 
genomic gene bank (Stratagene No. 946203) according to a 
method described in detail by Ausubel et al. (Current 
Protocols in Molecular Biology, published by Greene 
Publishing Associates and Wiley-Interscience (1989)). 
One phage (A.2.7.4) was isolected from 8 x 10 5 A, phages 
which contained an insertion of about 20 kb and which is 
deposited at the DSM under the depository number 7387. 
This clone contains further sequence informations at the 
5' end in addition to the sequence isolated from mRNA by 
the described amplification methods. 

For the sequence analysis a Hindlll fragment of about 
7.5 kb was subcloned into a vector cleaved in the same 
manner (Bluescript SK, Stratagene No. 212206) . This 
plasmid denoted SKL 52 (H3) MP12 was also deposited at 
the DSM under the depository number 7353. The sequence 
information shown in SEQ ID NO. 1 was derived from the 
phage X 2.7.4.. The ATG at position 64 0 is the first ATG 
within the reading frame (a stop codon occurs at 
position 403). Based on the sequence data it may be 
assumed that this is the start codon for the 
translation. 

The genomic DNA contains an intron of about 2 kb between 
base pairs 1270 and 1271 of SEQ ID NO: 1. The sequence 
of the intron is not shown. The correctness of the 
splicing site was confirmed by sequencing an 
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amplification product which was derived from cDNA 
containing this region. These sequence informations were 
obtained using a slightly modified method which is 
described in detail by Frohmann (Amplifications, 
published by Perkin-Elmer Corporation, Issue 5 (1990) , 
pp 11-15) . The same embryonic RNA that was also used to 
isolate the 3' end of MP-52 was reversally transcribed 
using an internal primer orientated in the 5' direction 
of the MP-52 sequence ( AC AGCAGGTGGGTGGTGTGGACT ) . A 
polyA tail was attached to the 5' end of the first 
cDNA strand using terminal transferase. A two-step 
amplification was carried out, firstly by using a 
primer composed of oligo dT and an adapter sequence 
( AGAATTCGCATGCCATGGTCGACGAAGC ( T 1 6 ) ) and secondly an 
adapter primer (AGAATTCGCATGCCATGGTCGACG) and an 
internal primer (CCAGCAGCCCATCCTTCTCC) from the MP-52 
sequence. The amplification products were reamplified 
using the same adapter primer and an overlapping 
internal primer (TCCAGGGCACTAATGTCAAACACG) from the MP- 
52 sequence. Subsequently the reamplif ication products 
were reamplified using an overlapping adapter primer 
(ATTCGCATGCCATGGTCGACGAAG) and an overlapping internal 
primer (ACTAATGTCAAACACGTACCTCTG) from the MP-52 
sequence. The final reamplif ication products were cloned 
with blunt ends into a vector (Bluescript SK, Stratagene 
No. 212206) which had been cleaved with EcoRV. The 
clones were characterized by their sequence overlapping 
with the DNA of X 2.7.4.. 

In addition a cDNA bank - produced from RNA of 
fibroblasts and cloned into XgtlO - was screened. In 
this process 2 x 10 5 phages were tested using a ca. 1 kb 
fragment of genomic MP-52 DNA (2nd exon up to the 
Hindlll restriction site in the 3 1 untranslated region) 
as a radioactive probe. 17 mixed plaques were picked out 
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which were checked using primers from the 5' and 3' 
region of the MP-52 sequence. Subsequently 8 phage 
plaques were selected and isolated. cDNA was isolated 
from the phage by an EcoRI partial cleavage and cloned 
into the Bluescript vector that was also cleaved with 
EcoRI . 

Sequencing of one of the resulting plasmids 
SK52L15. 1MP25 showed that the longest phage (15.1) 
starts at nucleotide No. 321 of SEQ ID NO. 1. In 
addition the splicing position (nucleotide 1270) was 
confirmed by the sequencing. 

The plasmid SKL 52 (H3) MP12 was deposited on 10th 
December 1992 at the DSM ("Deutsche Sammlung von 
Mikroorganismen und Zellkulturen, Mascheroder Weg lb, 
3300 Braunschweig") under the depository number 7353. 

The phage X 2.7.4 was deposited on 13th January 1993 at 
the DSM under the depository number 7387. 

The plasmid SK52L15 . 1MP25 was deposited on 16th July 
1993 at the DSM under the depository number 8421. 



ABSTRACT 



The invention concerns a protein of the TGF-J3 family, 
the DNA coding therefor and a pharmaceutical composite 
containing the protein. 



SEQ ID NO.l 

TYPE OF SEQUENCE: nucleic acid sequence 
NAME AND ORIGIN: MP-52 gene 
LENGTH: 2703 nucleotides 



CCAJGQCCTCGftAAG33C2^^ 

TIXXITGAGITCAGGTTTG^ 
GTOTGTGTCTGTGIGIXJ^^ 

CGGTIQGJCTITICTCOT 
GTCTG3ATAa^3W32AT^ 



AAQGagmxxmxaoGcc^^ 

CAQCXX3W3AftaaTGAaCCC^^ 
CO^GGACACCCiaD0CA?«CA^ 



AC^XX^DSAGE^ATGCTC^^ 




GGAHCTGGCTGGGfiGGTGTICGACMCU 

CT3ITCX;ia?lUmUGCCGC 



CCCACGCGGCTSACTCCC^^ 
GICTICCTGQGTOGC&CA!^^ 




CCT 



SEQ ID NO. 2 

TYPE OF SEQUENCE: amino acid sequence 
NAME AND ORIGIN: MP-52 protein 
LENGTH: 501 amino acids 



MRLPKLLTFL LWYLAWLDLE FICTVLGAPD LGQRPQGTRP GLAKAEAKER 
PPLARNVFRP GGHSYGGGAT NANARAKGGT GQTGGLTQPK KDEPKKLPPR 
PGGPEPKPGH PPQTRQATAR TVTPKGQLPG GKAPPKAGSV PSSFLLKKAR 
EPGPPREPKE PFRPPPITPB EYMLSLYRTL SDADRKGGNS SVXLEAGLAN 
TITSFIDKGQ DDRGPWRKQ RYVFDISALE KDGLLGAELR ILRKKPSDTA 
KPAAPGGGRA AQLXLSSCPS GRQPASLLDV RSVPGLDGSG WEVFDIWKLF 
RNFKNSAQLC LELEAWERGR AVDLRGLGFD RAARQVHEKA LFLVFGRTKK 
RDLFFNEIKA RSGQDDKTVY EYLFSQRRKR RAPLATRQGK RPSKNLKARC 
SRKALHVNFK DMGWDDWIIA PLEYEAFBCE GLCEFPLRSR LEPTNHAVIQ 
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